sfla based gene selection approach for improving cancer classification accuracy

نویسندگان

jamshid pirgazi

ali reza khanteymoori

چکیده

in this paper, we propose a new gene selection algorithm based on shuffled frog leaping algorithm that is called sfla-fs. the proposed algorithm is used for improving cancer classification accuracy. most of the biological datasets such as cancer datasets have a large number of genes and few samples. however, most of these genes are not usable in some tasks for example in cancer classification. therefore, selection of the appropriate genes is important in bioinformatics and machine learning. the proposed method combines the advantage of wrapper and filter methods for gene subset selection. sfla-fs consists of two phases. in the first phase a filter method is used for gene ranking from high dimensional microarray data and in the second phase, sfla is applied to gene selection. the performance of sfla-fs evaluated for cancer classification using seven standard microarray cancer datasets. experimental results are compared with those of obtained from several existing well-known gene selection algorithm. the experimental results show that sfla-fs has a remarkable ability to generate reduced size of genes while yielding significant classification accuracy in cancer classification.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SFLA Based Gene Selection Approach for Improving Cancer Classification Accuracy

 In this paper, we propose a new gene selection algorithm based on Shuffled Frog Leaping Algorithm that is called SFLA-FS. The proposed algorithm is used for improving cancer classification accuracy. Most of the biological datasets such as cancer datasets have a large number of genes and few samples. However, most of these genes are not usable in some tasks for example in cancer classification....

متن کامل

SFLA Based Gene Selection Approach for Improving Cancer Classification Accuracy

In this paper, we propose a new gene selection algorithm based on Shuffled Frog Leaping Algorithm that is called SFLA-FS. The proposed algorithm is used for improving cancer classification accuracy. Most of the biological datasets such as cancer datasets have a large number of genes and few samples. However, most of these genes are not usable in some tasks for example in cancer classification. ...

متن کامل

Improving Cancer Classification Accuracy Using Gene Pairs

Recent studies suggest that the deregulation of pathways, rather than individual genes, may be critical in triggering carcinogenesis. The pathway deregulation is often caused by the simultaneous deregulation of more than one gene in the pathway. This suggests that robust gene pair combinations may exploit the underlying bio-molecular reactions that are relevant to the pathway deregulation and t...

متن کامل

A Novel Scheme for Improving Accuracy of KNN Classification Algorithm Based on the New Weighting Technique and Stepwise Feature Selection

K nearest neighbor algorithm is one of the most frequently used techniques in data mining for its integrity and performance. Though the KNN algorithm is highly effective in many cases, it has some essential deficiencies, which affects the classification accuracy of the algorithm. First, the effectiveness of the algorithm is affected by redundant and irrelevant features. Furthermore, this algori...

متن کامل

Classification and Biomarker Genes Selection for Cancer Gene Expression Data Using Random Forest

Background & objective: Microarray and next generation sequencing (NGS) data are the important sources to find helpful molecular patterns. Also, the great number of gene expression data increases the challenge of how to identify the biomarkers associated with cancer. The random forest (RF) is used to effectively analyze the problems of large-p and smal...

متن کامل

Improving Classification Accuracy Using Gene Ontology Information

Classification problems, e.g., gene function prediction problem, are very important in bioinformatics. Previous work mainly focuses on the improvement of classification techniques used. With the emergence of Gene Ontology (GO), extra knowledge about the gene products can be extracted from GO. Such kind of knowledge reveals the relationship of the gene products and is helpful for solving the cla...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
amirkabir international journal of modeling, identification, simulation & control

ناشر: amirkabir university of technology

ISSN 2008-6067

دوره 47

شماره 1 2015

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023